Minable Data Warehouse

نویسندگان

  • David Morgan
  • Jai W. Kang
  • James M. Kang
چکیده

Data warehouses have been widely used in various capacities such as large corporations or public institutions. These systems contain large and rich datasets that are often used by several data mining techniques to discover interesting patterns. However, before data mining techniques can be applied to data warehouses, arduous and convoluted preprocessing techniques must be completed. Thus, we propose a minable data warehouse that integrates the preprocessing stage in a data mining technique within the cleansing and transformation process in a data warehouse. This framework will allow data mining techniques to be computed without any additional preprocessing steps. We present our proposed framework using a synthetically generated dataset and a classical data mining technique called Apriori to discover association rules within instant messaging datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Challenges in supporting the creation of data minable regulatory codes: a literature review

As standards and regulatory codes are issued by third party organizations and committees, the project organization can neither control the content of all standards that the projects should adhere to, nor negotiate or make changes to them that can make the project development easier. Moreover, large infrastructure projects require compliance with hundreds of standards of regulations coming from ...

متن کامل

Improvement of the Analytical Queries Response Time in Real-Time Data Warehouse using Materialized Views Concatenation

A real-time data warehouse is a collection of recent and hierarchical data that is used for managers’ decision-making by creating online analytical queries. The volume of data collected from data sources and entered into the real-time data warehouse is constantly increasing. Moreover, as the volume of input data to the real time data warehouse increases, the interference between online loading ...

متن کامل

ارائه مدل تلفیقی برای ارزیابی آمادگی سازمان ها جهت پیاده سازی سیستم انباره داده با استفاده ازتحلیل سلسله مراتبی

Enterprise Data Warehouse initiative is a high investment project. The adoption of Data Warehouse will be significantly different depending upon the level of readiness of an organization. Before implementation of Data Warehouse system in a firm, it is necessary to evaluate the level of the readiness of firm. A successful Data Warehouse assessment model requires a deep understanding of opportuni...

متن کامل

Utility of Ranking Warehouse Candidates in Workshop Locations Using UTAStar

Although the importance of locating in manufacturing and service companies is not a new issue, one of significance applications is to determine the appropriate location for warehouses in manufacturing workshops warehouses to the maintenance of materials or products. In any organizations, Finding the suitable site for warehouses establishments to increase customer service and efficiency is one o...

متن کامل

Advanced Data Warehouse in Telecommunication Industries

Data warehouse is the powerful tool in telecommunications industry for handling the massive amounts of data. It assists telecommunication companies in achieving a competitive advantage and higher profits. Theaim of this paper is to focus on data warehousing in telecommunication companies and why they need advanced data warehouse platforms and new technology. It compares traditional data warehou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009